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EXTRACELLULAR E3CPRESSION OF CELLULOSE BINDING DOMAINS (CBD) US- 
ING BACILLUS 

5 The present invention relates to a transformed Bacillus host 

capable of expressing a cellulose binding domain polypeptide, a 
Bacillus expression vector, and a method for producing a 
cellulose binding domain in a Bacillus host cell. 

10 BACKGROUND OF THE INVENTION 

Focus on the CBD as a functional domain has involved the 
synthesis of the domain as a single domain molecule. 
One of the first pure CBD*s was obtained as synthesized by 
automated solid phase synthesis (Kraulis P. et al. (1989) . 

15 It has been shown that CBDs can be expressed in E.coli as 

functional single domains, see e.g.: Ong E. et al. (1993), 
wherein it is disclosed that expression using E.coli results in 
a yield of 33 mg CBD per litre of culture fluid in the perip- 
lasma of the cells. 

20 Recently, a double fungal CBD (a dimer) has also succes- 

fully been expressed in coli, see Linder M. Et al. (1996). 

However, the expression of CBD's in E. coli is not a true 
extracellular expression and results in an unsatisfactory yield 
which is too low for industrial scale production of CBD. 

25 US 5,525,195, US 5,536,655, WO 91/17244 and WO 91/10732 

discloses expression in a Bacillus host cell of an 
endoglucanase enzyme which has the catalytically active domain 
operably linked to a cellulose binding domain. 

Accordingly, it is the object of the present invention to 

30 provide a method for producing CBD in a high yield, preferably 
by means of a conventional fermentation technique involving 
extracellular production of the CBD which in turn makes the use 
of CBD in industrial applications economically feasible. 

35 SUMMARY OF THE INVENTION 

The inventors have now found that it is possible to 
produce cellulose binding domains (CBDs) by expression in a 
Bacillus host. 
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Before the present invention, expression of a CBDs in 
Bacillus was highly unexpected, since, firstly, cellulose 
binding domains are known to contain disulfide bridges and, 
secondly, are potentially susceptible to degradation by 
5 proteases produced by the Bacillus host. 

Accordingly, in its first aspect the present invention 
relates to a Bacillus host transformed with a vector comprising 
a DNA sequence encoding for a cellulose binding domain and 
capable of expressing the DNA sequence. 
10 In a second aspect, the invention relates to a Bacillus 

expression vector which carries an inserted DNA sequence encod- 
ing for a cellulase binding domain. 

Further, in its third aspect, the present invention re- 
lates to a method for producing a cellulose binding domain 
IS polypeptide in a Bacillus host cell, the method comprising the 
steps of 

- growing under conditions to overproduce cellulose binding do- 
main in a nutrient medium Bacillus host cells which have been 
transformed with an expression cassette which includes, as op- 
20 erably joined components, 

a) a transcriptional and translational initiation regulatory 
region, 

b) a DNA sequence encoding the cellulose binding domain 
polypeptide , 

25 c) a transcriptional and translational termination regulatory 
region, wherein the regulatory regions are functional in the 
host , and 

d) a selection marker gene for selecting transformed host 
cells; and 

30 - recovering the cellulose binding domain polypeptide. 

DETAILED DESCRIPTION OF THE INVENTION 

A cellulose binding domain (CBD) is a polypeptide which 
has high affinity for or binds to water-insoluble forms of 
35 cellulose and chitin, including crystalline forms. 

CBDs are found as integral parts of large protein 
complexes consisting of two or more different polypeptide 
domains, for example in hydro lytic enzymes (hydrolases) which 
typically are composed of a catalytic domain containing the 
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active site for substrate hydrolysis, and a carbohydrate- 
binding domain or cellulose-binding domain (CBD) for binding to 
the insoluble matrix. Such enzymes can comprise more than one 
catalytic domain and one, two or three CBDs and optionally one 
5 or more polypeptide regions linking the CBD(s) with the 

catalytic domain (s) , the latter regions usually being denoted a 
"linker". Examples of hydrolytic enzymes comprising a CBD are 
cellulases, xylanases, mannanases, arabinof uranosidases , acetyl 
esterases and chitinases* CBDs have also been found in algae, 
10 e.g. the red alga Porphyra purpurea as a non-hydro lytic 

polysaccharide-binding protein, see Peter Tomme et al.(1996). 
However, most of the known CBDs are from cellulases and 
xylanases. 

In this context, the term "cellulose-binding domain" is 
15 intended to be understood as defined by Tomme et al., op. cit. 
This definition classifies more than 120 cellulose-binding 
domains into 10 families (I-X) which may have different 
functions or roles in connection with the mechanism of 
substrate binding « However, during the work resulting in the 
20 present invention a hitherto unknown CBD family has been found, 
cf • example 8 below; and it is anticipated that new family 
representatives and additional CBD families will appear in the 
future. 

In the protein complex, typically a hydrolytic enzyme, a 
25 CBD is located at the N or C termini or is internal. 

A monomeric CBD typically consists of more than about 30 
and less than about 250 amino acid residues. For example, a CBD 
classified in Family I consists of 33-37 amino acid residues; a 
CBD classified in Family Ila consists of 95-108 amino acid 
30 residues; and a CBD classified in Family VI consists of 85-92 
amino acid residues. Accordingly, the molecular weight of a 
monomeric CBD will typically be in the range of from about 4kD 
to about 4 0kD, and usually below about 35kD. 

CBDs may be useful as a single domain polypeptide or as a 
35 dimer, a trimer, or a polymer; or as a part of a protein 
hybrid. 
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Chimeric protein hybrids 

Chimeric protein hybrids are known in the art, see e.g. 
wo 90/00609, wo 94/24158 and WO 95/16782, and comprise a 
5 cellulose binding domain (CBD) from another origin, preferably 
from another microbial origin, than the chimeric protein as 
such, which CBD exists as an integral part of the protein. 
Typically, the chimeric protein hybrids are enzyme hybrids, 
i.e. contain a catalytic domain together with the binding 
10 domain. 

Chimeric protein hybrids and enzyme hybrids can be 
prepared by transforming into a host cell a DNA construct 
comprising at least a fragment of DNA encoding the cellulose- 
binding domain (CBD) ligated, with or without a linker, to a 
15 DNA sequence encoding the protein or enzyme and growing the 
host cell to express the fused gene. The recombinant fusion 
protein or enzyme hybrids may be described by the following 
formula: 

20 CBD - MR - X 

wherein CBD is the N-terminal or the c-terminal region of an 
amino acid sequence corresponding to at least the cellulose- 
binding domain; MR is the middle region (the linker) , and may 

25 be a bond, or a short linking group preferably of from about 2 
to about 100 carbon atoms, more preferably of from 2 to 40 
carbon atoms; or is preferably from about 2 to to about 100 
amino acids, more preferably of from 2 to 40 amino acids; and X 
is an N-terminal or C-terminal region of a polypeptide encoded 

30 by the DNA sequence encoding the protein or enzyme. 

However, recombinant fusion protein or enzyme hybrids 
having an internal CBD are also contemplated. 

A DNA sequence encoding a CBD from a given organism can 
be obtained conventionally by using PGR techniques, and, also 

35 based on current knowledge, it is possible to find homologous 
sequences from other organsisms. 

It is contemplated that new CBDs can be found by cloning 
cellulases, xylanases or other plant cell wall degrading enzyme 
and measure the binding to cellulose. If the enzyme activity is 
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bound to Avicel under the standard condtions described below, 
it can be assumed that part of the gene codes for a binding do- 
main. 

Having obtained the DNA fragment coding for a CBD the DKA 
5 gene is inserted in a vector suitable for its expression in Ba^ 
cillus spp. 

For example, cellulose affinity can be measured by using 
10 g of Avicel in a 500 ml buffered slurry (buffer: 0.1 sodium 
phosphate, pH 7.5) which is stirred slowly using a spoon and 

10 left swelling for 30 minutes at room temperature. Then the en- 
zyme is added in a ratio of 1 part cellulose binding domain to 
150 parts Avicel. This is done on ice which gives optimum bind- 
ing within 5 to 10 minutes. The Avicel can then be washed and 
applied directly to SDS-PAGE for visualization of the bound 

15 proteins (since the use of SDS and cooking will release the 
bound proteins) . Alternatively, the slurry is packed into a 
column and washed. The bound protein is eluted, either in ion- 
ized water or in a high pH buffer such as triethylaanine (pH 
11.2; 1% solution), where the pH eluted protein is (juickly ad- 

20 justed to neutral. 

Several CBD*s have been expressed in Em coll, however, 
none has ever been reported expressed and secreted from Bacil- 
lus sp, E,coli as an expression host for heterologues proteins 
has several advantages over Bacillus spp., firstly because 

25 E.coll has a periplasmic space where proper folding of het- 
erologues expressed genes are possible (for review see for ex- 
ample Hockney, R.C. (1994). Especially the oxidizing potential 
and the existence of disulfide oxidoreductases in the perip- 
lasma is necessary when expressing proteins with a functional- 

30 ity dependent on properly arranged disulfide bridges (Emmanuel 
Brun et al. (1995). Overproduction, purification and charac- 
terization of the cellulose binding domain of the Erwinia chry- 
santheml secreted endoglucanase EG2 is disclosed in Eur. J. 
Biochem 231, 142-148, and Ong et al., (1993). Further examples 

35 of CBDs with disulfide bonds are: the N-terminal CBD of CelB 

from PsBudomonas fluorescens subsp cellulosa (NCIMB 10462) (see 
the alignment in Tomme P. et al., op. cit., and the N-terminal 
CBD of CenA from Cellulomonas fimi (ATCC 484) , N.R. Gilkes et 
al. (1991). 
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Furthermore, the periplasma of E.coli also acts as in 
protecting the heterelogously expressed protein towards the ac- 
tion of proteases present in the supernatant as well as the cy- 
toplasm. 

5 It is also known that, when expressing secreted proteins 

with disulfide bridges in Bacillus subtilis, the level of ex- 
pression drops significantly (van den Berg et al.(1993))* 

Another problem with heterologue expression is the prote- 
olytic degradation of the expressed protein. Bacillus subtilis 
10 is known to express at least 7 different extracellular prote- 
ases (Eds. A.L. Sonenshein et al. (1993)}. 

Especially for CBDs which are highly hydrophobic pro- 
teins, the translocation of the protein when expressed in Ba- 
cillus subtilis could be severely hampered and even cause cell 
15 death due to deleterious effects if the protein gets anchored 
to the cell membrane because of its hydrophobic ity. 

In its first aspect, the present invention relates to a 
Bacillus host transformed with a vector comprising a DNA se- 
quence encoding for a CBD and capable of expressing the se- 
20 quence. Obviously, the expressed polypeptide consists essen- 
tially of one or more non-catalytical domains, i.e. the 
polypeptide does not comprise any catalytically active domain. 

In a preferred embodiment, the expressed CBD or CBD- 
containing polypeptide has a molecular weight (Mw) which is 
25 equal to or higher than about 4 kD. Preferably, the Mw is equal 
to or below about 35 kD, more preferably about 32 kD, even more ^ 
preferably about 30 kD, especially about 25 kD. 

The CBD may be expressed in the form of a single domain 
polypeptide, i.e. a polypeptide comprising one CBD. Alterna- 
30 tively, the CBD may be expressed in the form of a dimer or 
trimer or even a polymer, i.e. a polypeptide or protein com- 
prising two, three, or even more than three identical CBD 
"units". 

The CBD can also be expressed as a part of a multidomain 
35 polypeptide, the non-CBD part of such a polypeptide being for 
example one, two or even more domains without catalytic activ- 
ity. 

It is believed that almost any CBD can be expressed ac- 
cording to the present invention, i.e. by means of a trans- 
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formed Bacillus host. Preferably, such CBDs are expressed which 
are obtainable from a microorganism or a plant, more preferably 
from a bacterium or from a fungus. 

Examples of CBDs from bacteria include CBDs obtainable 
5 from species belonging to one of the following genera: 

Butyrivibrio, CGllulomonas^ Clostridium, Microbispora, Micro- 
monospora, Pseudomonas, streptomyces, Thermomonospora, Bacil'^ 
lus, Caldocellum, Erwinia, Myxococcus, CBllvibrio, Thermoan^ 
aerobacterium, and Thermotoga. 
10 Examples of CBDs from fungi include CBDs obtainable from 

species belonging to one of the following genera: 
Agrajrlcus, Dictyostelium, Fusarium, Humicola, Neocallimastix, 
Neurospora, Limulus, Penicillium, PhanerochaBtB, and Tricho- 
dBrma • 

15 Examples of CBDs obtainable from plants are CBDs from ex- 

pansins. 

The Bacillus host of the present invention is a neutralo- 
philic, a an alkalophilic, a mesophllic, or a thermophilic 
host • 

20 Examples of hosts which are useful in the present inven* 

tion are hosts from the species Bacillus subtilis. Bacillus li- 
chBniformis, Bacillus ntBgatBrium, Bacillus stBaroth&rmophilos , 
and Bacillus amyloliquBfaciBns, However, it is contemplated 
that other Bacillus species may also be useful hosts for ex- 

25 pression of CBDs 

As described in further detail below, the host of the in- 
vention is transformed with a vector comprising a CBD encoding 
DNA sequence. Preferably, the vector is integrated into the 
genome of the host, more preferably it has been amplified on 

30 the genome. 

In another preferred embodiment of the invention, the 
vector is present as an expression plasmid, preferably as a 
multicopy plasmid. 

In a second aspect, the present invention relates to a 

35 Bacillus expression vector which carries an inserted CBD- 

encoding DNA sequence. Preferably, the expression cassette of 
the vector comprises regulatory regions from a Bacillus sp., 
more preferably are such regulatory regions endogenous to the 
host. 
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In a third aspect, the present invention relates to a 
method for producing a CBD polypeptide, the method comprising 
the steps of 

- growing under conditions to overproduce cellulose binding do- 
5 main in a nutrient medium Bacillus host cells which have been 

transformed with an expression cassette which includes, as op- 
erably joined components, 

a) a transcriptional and translational initiation regulatory 
region, 

10 b) a DNA sequence encoding the cellulose binding domain 
polypeptide, 

c) a transcriptional and translational termination regulatory 
region, wherein the regulatory regions are functional in the 
host, and 

15 d) a selection marker gene for selecting transformed host 
cells; and 

- recovering the cellulose binding domain polypeptide. 

In its fourth aspect, the present invention relates to a 
method for optimisation of CBD expression in a Bacillus host, 
20 the method comprising the steps of expression in the host of a 
CBD fused to a reporter molecule; and monitoring the concentra- 
tion of expressed CBD in the supernatant of the fermented host 
by measuring the intrinsic property or properties of the re- 
porter molecule. 

25 In a preferred embodiment, the reporter molecule is a 

Green Fluorescent Protein, and the intrinsic property is fluo- 
rescence emission. 

In its fifth and sixth aspect, the invention relates to a 
polypeptide hybride consisting essentially of one or more cel- 

30 lulose binding domain (s) fused to a green floturescent protein, 
and to a method of producing such a hybride by expression in a 
Bacillus host, growth of the transformed host under conditions 
whereby the transformed culture is substantially free of un- 
transformed cells; incubation of the transformed culture in a 

35 nutrient medium, whereby the hybride is overproduced; and re- 
covery of the hybride. 
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EXPRESSION OF A CBD 

Recombinant expression vectors 

A recombinant vector comprising a DNA construct encoding 
the CBD of the invention may be any vector which may conveni- 
ently be subjected to recombinant DNA procedures, and the 
choice of vector will often depend on the host cell into which 
it is to be introduced • This introduction of vector into the 
host cell is often referred to as the transformed host cell. 
Such transformation indicates introduction of DNA into a host 
cell by using e.g. protoplasts, natxxral competent cells, 
transf ection, conjugation, electroporation, or any equivalent 
method. Thus, the vector may be an autonomously replicating 
vector, i.e. a vector which exists as an extrachromosomal 
entity, the replication of which is independent of chromosomal 
replication, e.g. a plasmid. Alternatively, the vector may be 
one which, when introduced into a host cell, is integrated into 
the host cell genome in part or in its entirety and replicated 
together with the chromosome (s) into which it has been 
integrated • 

The vector is preferably an expression vector in which 
the DNA sequence encoding the CBD of the invention is operably 
linked to additional segments required for transcription of the 
DNA. In general, the expression vector is derived from plasmid 
or viral DNA, or may contain elements of both. The term, 
"operably linked" indicates that the segments are arranged so 
that they function in concert for their Intended purposes, e.g. 
transcription initiates in a promoter and proceeds through the 
DNA sequence coding for the CBD. 

The promoter may be any DNA seG[uence which shows 
transcriptional activity in the host cell of choice and may be 
derived from genes encoding proteins either homologous or 
heterologous to the host cell. 

Examples of suitable promoters for use in bacterial host 
cells include the promoter of the Bacillus stearothermophilus 
maltogenic amylase gene, the Bacillus lichBjiiformis alpha- 
amylase gene, the Bacillus amyloliquefaciens alpha-amylase 
gene, the Bacillus subtilis alkaline protease gen, or the 
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Bacillus pumilus xylosidase gene, or the phage Lambda Pr or 
promoters or the E. coli lac, trp or tac promoters. 
Alternatively, it is possible to design integration vectors 
such that the DNA encoding the CBD will only become 
5 functionally expressed once it is properly integrated into the 
host genome, e.g. downstream from a resident promoter. 

The DNA sequence encoding the CBD of the invention may 
also, if necessary, be operably connected to a suitable 
terminator . 

10 The recombinant vector of the invention may further 

comprise a DNA sequence enabling the vector to replicate in the 
host cell in question. 

The vector may also comprise a selectable marker, e.g. a 
gene the product of which complements a defect in the host 

15 cell, or a gene encoding resistance to e.g. antibiotics like 
kanamycin, chloramphenicol , erythromycin, tetracycline, 
spectinomycine, or the like, or resistance to heavy metals or 
herbicides. 

To direct an CBD of the present invention into the secre- 

20 tory pathway of the host cells, a secretory signal sequence 
(also known as a leader sequence, prepro secpience or pre 
sequence) may be provided in the recombinant vector. The 
secretory signal sequence is joined to the DNA sequence 
encoding the CBD in the correct reading frame. Secretory signal 

25 sequences are commonly positioned 5 ' to the DNA sequence 
encoding the. The secretory signal sequence may be that 
normally associated with the CBD or may be from a gene encoding 
another secreted protein. 

The procedures used to ligate the DNA sequences coding 

30 for the present CBD, the promoter and optionally the terminator 
and/or secretory signal sequence, respectively, or to assemble 
these sequences by suitable PCR amplification schemes, and to 
insert them into suitable vectors containing the information 
necessary for replication or integration, are well known to 

35 persons skilled in the art (cf., for instance, Sambrook et al., 
op.cit. \ . 

Green Flourescent Protein (GFP) has become a widely used 
reporter molecule for monotoring gene expression, tracers of 
cell lineage and as fusion tags for proteins (Crameri et al. 
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(1996) ; Cubitt et al. (1995) ; International Patent Application 
PCT/DK96/00051) . 

GFP could be fused to CBD*s creating a fusion protein 
having the cellulose binding property as well as the f loures- 
5 cent properties. The expression of this fusion protein could be 
used to monitor the expressing of CBD's in Bacillus species and 
hereby be used to optimize expression levels of given CBD*s. 

EXAHPLE8 

10 

MATERIALS AND METHODS 
Strains: 

Bacillus agaradherens NCIMB No. 40482 comprises the 
15 endoglucanase enzyme encoding DNA sequence of example 8« 
E.colii SJ2 (Diderichsen, B. et al. (1990)) 
Electrocompetent cells prepared and transformed using a Bio-Rad 
GenePulser™ as recommended by the manufacturer. 

B. subtilis PL2306. This strain is the J3. suhtilis DN1885 
20 (Diderichsen, B. et al. (1990)) disrupted in the 

transcriptional unit of the known Bacillus subtilis cellulase 
gene, resulting in cellulase negative cells. Furthermore the 
strain was disruptedin the aprE and nprE genes (aprE: Stahl and 
Ferrari (1984)) and {nprE: Yang et al (1984)). The disruptions 
25 were performed essentially as described in (Eds. Sonenshein et 
al. (1993) , p. 618) . 

B. subtilis PL2304. This strain is the. B. subtilis DN1885 
(Diderichsen, B., op. Cit.) disrupted in the transcriptional 
unit of the known Bacillus subtilis cellulase gene, resulting 
30 in cellulase negative cells. The disruption was performed 
essentially as described in (Eds. A.L. Sonenshein, op cit.) 
B. subtilis ToC46 (Diderichsen, B. et* al., op.cit.)^ 

Plasmids: 

35 pMBlOO, which is a derivative of pDN1528 (S.J0rgensen et 

al. (1991). The plasmid is essentially the saune as pDN1528, 
however a SacI sites was for cloning purposes introduced be- 
tween the stop codon of the amyL gene and its terminator. 
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pDN1981 (P.L. J0rgensen et al. (1990)} 
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Solutions /Media 

TV and LB agar (as described in Ausubel, F. M. et al*, 

5 1995) . 

SB: 32 g Tryptone, 20 g Yeast Extract, 5 g NaCl and 5 ml 
1 N NaOH are mixed in sterile water to a final volume of 1 li- 
ter. The solution is sterilised by autoclaving for 20 min at 
121*C. 

10 10% Avicel: 100 g of Avicel (FLUKA, Switserland) is mixed 

with sterile water to a final volume of l litre, and the 10% 
Avicel is sterilised by autoclaving for 20 min at 121 *C. 

Stock solution of Congo red (SIGMA, USA) . 1% in ionized 
water • 

15 Buffer: 0.1 M potassium phosphate, pH 7.5. 

General molecular biology methods: 

DNA manipulations and transformations were performed 
using standard methods of molecular biology (Sambrook et al. 
20 (1989) Molecular cloning: A laboratory manual. Cold Spring 
Harbor lab.. Cold Spring Harbor, NY; Ausubel, F. M. et al., 
1995; Harwood and Cutting, 1990). 

Enzymes for DNA manipulations were used according to the 
specifications of the suppliers. 

25 

EXAMPLES 1*3 

Isolation of genomic DNA 

Cellulomanas flmi ATCC484 was grown in TY at 30 *C, 250 
30 rpm for 24 hours, cells were harvested by centrif ugation. 

Clostridium stercorarium NCIMB 11754 was grown anaerobi- 
cally at 60 'C in specified media as recommended by The National 
Collections of Industrial and Marine Bacteria Ltd. (Scotland) . 
Cells were havested by centrif ugation. 
35 Pseudomonas flourescens ssp cellulosa NCIMB 10462 was 

grown on TY agar plates for 24 hours at 30 'C. Cells were 
scraped of for isolation of genomic DNA. 
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Froxa any of the mentioned species, genomic DNA was iso- 
lated as described by Pitcher et al. (1989) • 

Identification of Cellulose Binding Domains present in Glycosyl 
s hydrolases. 

Cellulose Binding Domains are classified in ten families 
according to their amino acid sequences, see Tomme et al. op. 
cit. Based on the disclosure in this review article three po- 
tentially different CBD sequences were choosen as models for 

10 expression purposes in B.subtilis: 

From the family Ila the CBD of Cellulomonas fimi (ATCC 
484) cellulase CenA (GenBank and SWISS-PROT Accession No. 
H15823) and the CBD of Pseudomonas flourescens (NCIMB 10462) 
CelB (GenBank and SWISS-PROT Accession No. X52615) were chosen. 

15 From the family VI the CBD-dimer of Clostridium stsrco^ 

rarium (NCIHB 11754) XynA (GenBank and SWISS-PROT Accession 
No. 13325) was chosen. 

The SWISS-PROT data obtained describe the position of the 
putative Cellulose Binding Domains, which informations were 

20 used to specifically designe PCR primers to obtains the DNA 

fragments encoding the CBD's from the three different bacteria. 

At the same time PCR primers were designed as to add ex- 
tra codons corresponding to amino acids preceding the signal 
sequence of amyL which is used to direct the CBD*s to the exte- 

25 rior of the Bacillus subtilis cell. 

in vitro amplification of the CBD of cellulomonas fimi (ATCC 
484) cellulase CenA 

30 Approximately 100 to 200 ng of genomic DNA was PCR 

amplified in PCR buffer (10 mM Tris-HCl, pH 8.3, 50 mM KCl, 1.5 
mM MgCl2# 0.01 % (w/v) gelatin) containing 200 /LtM of each dNTP, 
l,5%DMSO ( SIGMA, USA) , 2.5 units of AmpliTaq polymerase (Per- 
kin-Elmer, Cetus, USA) and 100 pmol of each primer: 

35 

CELFIMOlU, 

5 • - CTG CCT CAT TCT GCA G CA GCG GCG GCA AAT CTT AAT GCT CCC GGC 
TGC CGC GTC GAC TAG -3* 
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CELFIMOID, 

5«-CTG OCT CAT TGC ATG C AG AGC TC C TAC TAG ACG GTG CCC GTG CAG 
GTG GTG -3« 

5 Restriction sites PstI and Sad are underlined. 

The PGR reactions was performed using a DNA thermal 
cycler (Landgraf , Germany) . One incubation at 94*^C for 5 min 
followed by thirty cycles of PGR performed using a cycle 
profile of denaturation at 94°C for 1 min, annealing at 65°C for 
10 1 min, and extension at 72 °C for 1 min. Ten-^l aliquots of the 
amplification product was analyzed by electrophoresis in 1-5 % 
agarose gels (NuSieve, FMC) with ReadyLoad lOObp DNA ladder 
(GibcoBRL, Denmark) as a size marker. 

15 In vitro amplification of the CBD of Pseudomonas flourescens 

(NCIMB10462) CelB 

Approximately 100 to 200 ng of genomic DNA was PGR 
amplified in HiFidelityTM PGR buffer (Boehringer Mannheim, 
Germany) supplemented with 200 /iM of each dNTP, 2.6 units of 
20 HiFidelityTM Expand enzyme mix and 300 pmol of each primer: 

PSUPPER, 

5t-cGT CCT CAT TCT GCA GC A GCG GCG GCA AAT CTT AAT GCA GCA GTG 
TGT GAA TAT CGG G -3* 

25 

PSLOWER, 

5 ■ -CTG CCT CAT TGC ATG C AG AGC TC C TAC TAT TGT CCA GCG CAA ATC 
GCC -3« 

Restriction sites PstI and SacI are underlined. 

30 The PGR reactions was performed using a DNA thermal 

cycler (Landgraf, Germany). One incubation at 94®C for 2 min, 
30 sec at 60®C and 45 sec at 72*^C followed by ten cycles of PGR 
performed using a cycle profile of denaturation at 94**C for 30 
sec, annealing at 60**C for 30 sec, and extension at 72**C for 

35 45sec and twenty cycles of denaturation at 94'*C for 3 0 sec, 60^C 
for 30 sec and 72*^C for 45 sec (at this elongation step 20 sec 
are added every cycle) . Ten-^l aliquots of the amplification 
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product was analyzed by electrophoresis in 1.5 % agarose gels 
(NuSieve, FMC) with ReadyLoad lOObp DNA ladder (GibcoBRL, 
Denmark) as a size marker. 

5 In vitro amplification of the CBD-dimer of oiQgtylii'Siiiii 
stercorarium fNCiMB 11754) XynA. 

Approximately 100 to 200 ng of genomic DNA was PGR 
amplified in HiFidelityTH PGR buffer (Boehringer Mannheim, 
10 Germany) supplemented with 200 fM of each dNTP, 2.6 units of 
HiFidelityTM Expand enzyme mix, and 300 pmol of each primer: 

GL0ST03U, 

5»- GTG CCT CAT TGT GCA G CA GCG GGG GCA AAT CTT AAT CCA ACT CCT 
15 GGG CGA TGT CAA AGC -3' 

GLOST03D2 , 

5«- GTG CCT CAT TGC ATG G AG AGC TG G TAG TAG GAG TCA ACA TTA ACA 
GGA CCT GAG-3» 

20 Restriction sites PstI and Sad are underlined. 

The PGR reactions was performed using a DNA thermal 
cycler (Landgraf , Germany) • One incubation at 94'^G for 2 min, 30 
sec at 60^C and 45 sec at 72*^C followed by ten cycles of PGR 
performed using a cycle profile of denatxiration at 94*'C for 30 

25 sec, annealing at 60C for 30 sec, and extension at 72^C for 

45sec and twenty cyles of denaturation at 94*'G for 30 sec, 60 *C 
for 30 sec and 72''C for 45 sec (at this elongation step 20 sec 
are added every cycle) .Ten-^l aliguots of the amplification 
product was analyzed by electrophoresis in 1.5 % agarose gels 

30 (NuSieve, FMC) with ReadyLoad lOObp DNA ladder (GibcoBRL, 
Denmark) as a size marker. 

Cloning by polymerase chain reaction (PCR) : 
Subcloning of PCR fragments. 

35 

Fourty-Ml aliquots of the PCR products generated as 
described above were purified using QIAquick PCR purification 
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kit (Qiagen, USA) according to the manufacturer's instructions. 
The purified DNA was eluted in 50 ^1 of lOnM Tris-HCl, pH 8. 5. 
Twenty five-/xl of the purified PCR fragment was digested with 
Sad and PstI, electrophoresed in 1.5% low gelling temperature 
5 agarose (SeaPlaque GTG, FMC) gels, the relevant fragments were 
excised from the gels, and purified using QIAquick Gel 
extraction Kit (Qiagen, USA) according to the manufacturer's 
instructions. The isolated DNA fragment was then ligated to 
PstI Sacl digested pMBlOO and the ligation mixture was used to 
10 transform B.subtllisi PL2306. 

Identification and charaterization of positive clones. 

Cells were plated on LB agar plates containing 

15 chloramphenicol (6 /zg/ml) , 0,4% glucose and lOmM potassivim 

hydrogen phosphate and incubated at 37®C over night. Next day 
colonies were restreaked onto fresh LBPG chloramphenicol agar 
plates and incubated at 37*^C over night. The next day single 
colonies of each clone were transferred to licjuid LB medium 

20 containing chloramphenicol (6 ^g/ml) and incubated overnight 
at 37^C with shaking at 250 rpm. 

Plasmids were extracted from the liquid cultures using 
QIAgen Plasmid Purification mini kit (Qiagen, USA) according to 
the manufacturer's instructions, however the resuspension 

25 buffer was supplemented with 1 mg/rol of Chicken Egg White 

Lysozyme (SIGMA, USA) prior to lysing the cells at 37°C for 15 
min* Five-Ml samples of the plasmids were digested with PstI 
and Sad. The digestions were checked by gelelectrophoresis on 
a 1.5% agarose gel (NuSieve, FMC). The appearence of a DNA 

30 fragment of the same size as seen from the PCR amplification 
indicated a positive clone. Three clones were selected each 
representing a CBD from the three different bacteria mentioned 
above: MB144 (expressing C. fimi CenA-CBD) , MB203 (expressing 
c.stercorarium XynA-dimer-CBD) and MB207 (expressing 

35 P.floiurescens ssp CBllulosa CelB-CBD) . 
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nucleotide sequencing the cloned DNA fragment 

Qiagen purified plasmid DNA was sequenced with the Taq 
5 deoxy terminal cycle sequencing kit (Perkin Elmer, USA) using 
the same primers as used above and using an Applied Biosystems 
37 3 A automated sequencer according to the manufacturers 
instructions. Analysis of the sequence data is performed accor- 
ding to Devereux et al . 

10 

Expression, secretion and functional analysis of the cloned 
CBDs 

The clones MB144 (expressing C. fimi CenA-CBD) , MB203 
15 (expressing c .stercorarium XynA-dimer-CBD) and MB207 

(expressing P . flourescens ssp cellulosa CelB-CBD) , where all 
incubated for 20 hours in SB-medium at 37**C and 250 rpm. 1 ml of 
cell-free supernatant was mixed with 200 m-I of 10% Avicel. The 
mixture was left for 1 hour incubation at 0®C. After this bind- 
20 ing of CBD to Avicel the Avicel with CBD was spun 5 min at 

5000g. The pellet was resuspended in 100 ill of SDS-page buffer, 
boiled at 95**C for 5 min, spun at 5000g for 5 min and 25 ^il was 
loaded on a 18% Laemmli Tris-Glycine, SDS-PAGE NOVEX gel 
(Novex, USA) . The samples were electrophoresed in a Xcell™ 
25 Mini-Cell (NOVEX, USA) as recommended by the manufacturer, all 
subsequent handling of gels including staining with comassie, 
destaining and drying were performed as described by the manu- 
facturer • 

The appearance of protein bands of the expected sizes 
30 (MB144 protein band approx. 12 kDa) , (MB203 protein band apx. 
35 kDa) and (MB207 protein band apx. 12 kDa) indicated expres- 
sion in B.subtilis of functional CBDs. 
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EXAMPLE 4 

5 Expression and purification of CBD-dimer cloned from 
G.stercorarium 

Plasmld Isolated from MB203 was used to transform another 
Bacillus subtilis ToC46, thus obtaining a new CBD-dimer ex- 
10 pressing clone MB206. Using this strain as the expression host 
for the CBD-dimer, the clone was incubated in shakeflasks con- 
taining SB media (6 ug/ml of chloramphenicol) for 20 hours, at 
37°C and shaking at 250 rpm. 

1400 ml of culture fluid supernatant was cooled on 
15 ice bath. It was filtrated through Whatman Glass filter F and 
then sterile filtrated through 0.45 micron millipore Type HVLP. 

50 gram of Avicel was suspended in 0.1 M Sodium Phosphate 
buffer, pH 7.5, at room temperatxire for 30 min. The supernatant 
was removed and the Avicel slurry was cooled to 4**C. The clear 
20 supernatant was mixed with the Avicel slurry at 4*'C for 30 min. 

The Avicel was settled for 10 min and the supernatant re- 
moved. The Avicel protein complex was packed in a column and 
washed with O.IM sodium phophate buffer, followed by buffer in- 
cluding 0.5M sodium chloride. Finally, the CBD was eluted by 
25 deionized water. 

A total of 78 ml was eluted containing CBD. The CBD was 
concentrated after addition of solid sodium chloride to a final 
concentration of 0.5M on an Amicon cell with a RBIP membrane 
with a cut off of 8 ko. 
30 The concentrated CBD solution (30 ml) had a absorbance at 

280 nm of 1*2 . The molar extinction coefficient of MB 206 was 
42000 corresponding to a protein concentration of 0.82, result- 
ing in a total of 25 mg of highly purified double CBD. Based on 
SDS-PAGE, the starting material had about 0.1 mg per ml of 29 
35 kD. The final purified product showed only a single band on 
SDS-PAGE. - 
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EXAMPLE 5 

Characterization of a dimerized Fungal CBD# cloned and ex*- 
B pressed in Baeillua aubtilis 

A CBD dimer of fungal origen is constructed by fusing the 
CBD encoded by the DNA sequence of Humicola insolens EGII with 
the CBD encoded by the DNA sequence of the 43 kDa from Humicola 
10 insolens. 

The DNA sequence encoding Hujnicola insolens EGII CBD and 
linker is PCR amplified from the plasmid carrying the cDNA of 
EG II also known as CMC 3 (Dalboge and Heldt Hansen, 1994) us- 
ing primers specific for the CBD region and furthermore the an- 

15 tisense primer is designed so as to give the PCR fragment an 
overhang identical to the DNA fragment encoding the proceeding 
CBD, the CBD encoded by the gene of the 43 kDa endoglucanase 
from H. insolens which is described in detail in EP-B-0 531 372 
and US 5,457,046. The DNA encoding this CBD is PCR amplified 

20 from genomic DNA of the Humicola insulens described in EP-B-0 
531 372. 

The two fragments are combined by SOE-PCR (Higuchi et al. 
(1988)) using the primers : 

25 #22857 

5'- CTG CCT CAT TCT GCA GCA GCG GCG GCA AAT CTT AAT CAG GGC GGT 
GCA TGG CAG CAG-3 • 

and the primer 
30 #20622 

5'- CTG CCT CAT TGC ATG CAG AGC TCC TAC TAC AGG CAC TGA TGG TAC 
CAG TC-3« 

This PCR fragment is, as a Pstl-SacI fragment, ligated to 
35 pMBlOO and the ligation mixture is used to transform Bacillus 
subtilis PL2306. 

The Cloned DNA essentially encoding the CBD-dimer can be 
found in the sequence CBD-EGII-CZ (327bp) : 
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GCAAATCTTA ATCAGGGCGG TGCATGGCAG CAGTGTGGTG GCGTTGGCTT 
CTCGGGCTCT ACGTCCTGTG TGTCCGGTTA CACGTGCGTG TACTTGAACG ACTGGTA- 
CAG CCAATGCCAG CCGCAGCCGA CGACGTTACG GACAACAACA ACGCCA6GGG 
CAACATCGAC AACAAGGTCA GCCCCGGCTG CCACTTCAAC CACTCCGGCC 
5 GGCTGCACTG CTGAGAGGTG GGCTCAGTGC GGCG6CAATG GCTGGAGCGG CTGCAC- 
CACC TGCGTCGCTG GCAGCACTTG CACGAAGATT 
AATGACTGGT ACCATCAGTG CCTGTAG 

and the corresponding amino acid sequence (108 aa residues) : 
10 ANLNQGGAWQ QCGGVGFSGS TSCVSGYTCV YLNDWYSQCQ PQPTTLRTTT 

TPGATSTTRS APAATSTTPA GCTAERWAQC GGNGWSGCTT CVAGSTCTKI NDWYHQCL 

Expression, secretion and functionality of the CBD is 
characterized as described above. 

15 

EXAMPLE 6 

Construction of 6FP-CBD fusion for CBD expression optimization. 

Approximately ICQ ng of plasmid DNA pMB144, plasmid is 
20 isolated as described above, is PCR amplified in PCR buffer (10 
mM Tris-HCl, pH 8.3, 50 mM KCl, 1.5 mM MgCl2, 0.01 % (w/v) 
gelatin) containing 200 iM of each dNTP, l,5%DMSO (SIGMA, USA) , 
2.5 units of AmpliTag polymerase (Perkin-Elmer, Cetus, USA) and 
100 pmol of each primer: 

25 

C-Fusionl: . 

5 ' -GTC AGT GAA TTC GCA TGC GTC CTT CTT TGT GOT TG-3 • 
C-Fusion2 : 

30 5*-CTC ATA AAG CTT ACG GTG CCC GTG CAG GTG GTG-3 • 

Restriction sites EcoRI and Hindlll are underlined. 

The PCR reactions is performed using a DNA thermal 
cycler (Landgraf ^ Germany) . One incubation at 94®C for 5 min 
35 followed by thirty cycles of PCR performed using a cycle 

profile of denaturation at 94**C for 1 min, annealing at 60**C for 
1 min, and extension at 72''C for 1 min. Ten-zil aliquots of the 
amplification product is analyzed by electrophoresis in 0.7 % 
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agarose gels (NuSleve, FMC) with ReadyLoad lOObp DNA ladder 
(GibcoBRL, Denmark) as a size marker. 

The fragment is purified, digested with EcoRI and Hindlll, 
gelpurified and ligated to vector pBR322 (Bolivar et al-(1977), 

5 Gene, 2, 95-113.) • 

The ligation mixture is used to transform SJ2 electrocom- 
petent E.coli. 

Identification and charaterisatlon of positive clones. 

10 

The transformed cells are plated on LB agar plates 
containing ampicillin (200 §JLg/ml) and incubated at 37®C 
overnight. Next day colonies are rescued by restreaking these 
onto fresh LB-ampicillin agar plates and incubated at 37*C over 

15 night. The second day single colonies of each clone are 

transferred to liquid LB medium containing ampicillin (200 

fig /ml) and incubated overnight at 37^C with shaking at 250 rpm. 

Plasmids are extracted from the liquid cultures using QIAgen 
Plasmid Purification mini kit (Qiagen, USA) according to the 

20 manufacturer's instructions. Five-Mi samples of the plasmids 

are digested with Hlndlll and EcoRI. The digestions are checked 
by gel electrophoresis on a 0.7% agarose gel (NuSieve, FMC). 

A derivative of GFP is cloned from the DNA construction of 
the mutant F64L-S65T-*GFP which was constructed as described in 

25 international patent application PCT/DK96/00051. 

The DNA fragment encoding the F64L-S65T-GFP is cloned as a 
BamHI-Hindlll fragment. In-frame with the CBD encoding DNA 
cloned in pBR322. Ligation, transformation and identification 
of a positive clone is done essentially as described above. 

30 This fusion construction is transferred as a EcoRI-BsunHI 

fragment from the E.coli vector to the vector pUBllO vector 
(Gryczan et al. (1978)). Bacillus subtilis PL2306 is trans- 
formed and positive clones are identified by ther ability to 
fluoresce and by the existence of an Avlcel binding F64L-S65T- 

35 GFP CBD fusion polypeptide. 

The wavelength of the light used for excitation of the 
F64L-S65T-GFP of this study is 488 nm, this activates the F64L- 
S65T-GFP to emit light at 510-530 nm. 
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The f lourescence of the supernatant: is measured by fluo- 
rescence spectroscopy and compared with the f lourescence of the 
supernatant after incubation with Avicel. Furthermore, the 
f lourescent molecule with CBD can be visualized by binding the 
5 fusion protein to Avicel, removing excess supernatant and 

transferring the Avicel to cuvettes for f lourescent measuring 
in a fluorescence spectrometer. 

By making serial dilutions of the Avicel bound or non- 
bound fusion protein, the expression level can be determined, 
10 thus making it possible to identify a Bacillus clone expressing 
relatively higher amounts of CBD. 

EXAMPLE 7 

15 Screening using CMC-CongoRed 

Recombinant Bacillus clones expressing CBD*s can be 
screened by means of the expression level of the CBD. 

In order to find Bacillus strains optimal for expressing 
a given CBD, the clones of interest are incubated in a suitable 

20 medium e.g. as described above in TY and incubated at appropri- 
ate growth conditions for 24 hours. Supernatant of the clones 
are transferred to Agarose-CMC-CongoRed-plates with punched 
holes, the supernatant with the CBD are left to bind to the CMC 
for 5 hours at 37°C. When washed 15 min with 2% NaCl solution, 

25 the CBD activity can be seen as a clearing zone. 

The plate assay can be combined as described below. 
Preparation of the gel for use in CBD plate assay: 0.5% 
CMC and 0,7% agarose (CMC; Carboxyroethylcellulose, 7LF from 
Hercules) (agarose; Litex HSA/HSB) are prepared by moisting the 

30 CMC and Agarose with 96% alcohol. 0.1 M potassium phosphate pH 
7.5 buffer is added and the mixture is heated to lOO^C until 
completely dissolved. The solution is left to cool at 60°C. 
Congo red stock solution is added to a final 5% and plates are 
poured, 15 ml to a petri dish with 9 cm diameter. 

35 Sample application holes are made with a puncher. 
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EXMIPLB 8 

Identification of a novel CBD defining a new CBD family 

5 

The alkaline cellulase cloned iin Bacillus subtilis as 
described below was expressed by incubating the clone for 20 
hours in SB-medium at 37 *C and 250 rpm. The expressed cellulase 
was shown to contain a CBD by its ability to specifically bind 
10 to Avicel. 

When left for incubation for a further 20 hours the cel- 
lulase was proteolytically cleaved and two specific protein 
bands appeared on SDS-page one corresonding to the catalytic 
part of the cellulase approximate molecular weight (MW) 35 kD 

15 and the other corresponding to a proposed linker and CBD of 
approximate MW 8 kD. 

The CBD was found to be the C-terminal part of the cellu- 
lase and the CBD did not match any of the previously described 
CBD families (Tomme et al., 1995, p, 142-161). Accordingly, 

20 this CBD is the first member of a new family. 

Cloning of the alkaline cellulase from Bacillus aaaradherens 
and expressio n of the alkaline endoaluoanase in Bacillua sub- 
tilis 

25 

The nucleotide sequence encoding the alkaline cellulase 
from Bacillus agaradherens (Deposition No. Deposition No. NCIMB 
40482) was cloned by PCR for introduction in an expression 
plasmid pDN1981. 

30 PCR was performed essentially as described above on 500 

ng of genomic DNA, using the following two primers containing 
Ndel and Kpnl restriction sites for introducing the 
endoglucanase encoding DNA sequence to pDN1981 for expression: 

35 Primer 5: (#20887) 

5 ■ -GTA GGC TCA G TC ATA TGT TAG ACA TTG AAA GGG GAG GAG AAT CAT 
GAA AAA GAT AAC TAC TAT TTT TGT CG-3 • 
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Primer 6: (#21318) 

5»-GTA CCT CGC G GG TAG CA A GCG GCC GCT TAA TTG AGT GGT TCC CAC 
5 GGA CCG-3» 

After PGR cycling the PGR fragment was purified using QIAquick 
PGR coulmn Kit (Qiagen, USA) according to the manufacturer's in- 
structions • The purified DNA was eluted in 50 /il of lOmM Tris- 
10 HCl, pH 8,5. Digested with Ndel and Kpnl purified and ligated to 
digested pDN1981* The ligation mixture was used to transform 
B.subtxlis PL2304. Competent cells were prepared and transformed 
as described by Yasbin et al., (1975). 

15 Isolation and test of Baoillua subtilis transformants. 

The transformed cells were plated on LB agar plates containing 
10 mg/ml Kanamycin, 0.4% glucose, 10 mM KH2P04 and 0.1% AZCL H£- 
cellulose (Megazyme, Australia) and incubated at 37 .^C for 18 

20 hours « Endoglucanase positive colonies were identified as colonies 
surrounded by a blue halo. 

Each of the positive transformants were inoculated in 10 ml 
TY-medium containing 10 mg/ml Kanamycin. After l day of incubation 
at 37 *C, 250rpm, 50 ml supernatant was removed. The endoglucanase 

25 activity was identified by adding 50 ml supernatant to holes punc- 
tured in the agar of LB agar plates containing 0.1% AZGL HE- 
cellulose. 

After 16 hoxirs incubation at 37 'C blue halos surrounding holes 
indicated expression of the endoglucanase in Bacillus subtilis* 

30 

EXAMPLE 9 

Assay for selecting CBDs 

35 Preparation of phosphoric acid swollen cellulose CPASC^ : 

5 g Avicel is moistened with water and added 150 ml ice 
cold 85 % phosphoric acid and is weakly stirred on an icebath 
for 1 hour. Then 500 ml cold acetone is added while stirring* 
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The swollen Avicel (PASC) is filtered on a glass-filter funnel 
and washed 3 times with 100 ml ice cold acetone and subse- 
quently 2 times with 500 ml water. The PASC is then suspended 
in 500 ml water and blended to homogeneity using an Ultra Tho- 
5 rax homogenizer. The PASC is stored cold. 

CBD^'bindina to phoBPhoric acid swollen cellulose fPASC) - se** 
lection of CBDs : 

10 400 ml 10 mg/ml PASC (prepared as described above and 

washed with 50 mM sodixim phosphate, pH 7) in 50 mM sodium phos- 
phate, pH 7 in Eppendorf tubes was mixed with 400 ml of cellu- 
lose binding domain (Cel5A CBD or MB206 double CBD) diluted in 
50 mM sodium phosphate, pH 7. The concentration of CBD was var- 

15 ied e.g. from 0 mM to around 8 mM for Cel5A CBD* A control se- 
ries was included without PASC. The samples were incubated for 
1 hour at room temperature before centrifuging the samples for 
4 minutes at 14000 g. 500 /il of the supernatant was diluted 
into 2 ml water. The amount of CBD present in the supernatant 

20 (free CBD) was then measured by tryptophan fluorescence spec- 
troscopy on the Perkin-Elmer LS50 luminescence spectrometer 
(excitation at 280 nm and emission at 340 nm) using the fluo- 
rescence intensity of the samples without PASC addition as a 
reference (standard curve) . The amount of bound CBD was then 

25 calculated as: total CBD (without PASC addition) - free cbd. 

Thus, a binding isotherm was obtained by plotting the amount of 
bound CBD per g of PASC as a function of free CBD in solution 
in mM as shown in Figure 1 and Figure 2. The data could be fit- 
ted using a simple Langmuir binding model (Bothwell et al., 

30 1995): E(bound) = (Ainax*S( free) )/ (Xd+E( free) ) , where E(bound) is 
the amount of bound CBD in mmol/g PASC and E(free) is the 
amount of free CBD in mM. A^ax is the maximum amount of CBD that 
can be bound to PASC and is the equilibrium constant for the 
equilibrium E( bound) « £(free) . Thus, the lower the Ka 

35 (desorption constant) the stronger the binding. These constants 
are obtained after fitting the data to the model using algo- 
rithms in GraphPad Prizm. Desorption constants found for Cel5A 
CBD and MB206 double CBD are 0.42 and 0.76 mM respectively (cf . 
figure 1 and figure 2) . 
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CBDs of the present invention show desorption constants 
below 1 mM more preferably below 0.1 mM and inost preferably be- 
low 10 niM. 
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CLAIMS 

1. A Bacillus host transformed with a vector comprising a DNA 
sequence encoding for a cellulose binding domain (CBD) and ca- 

5 pable of expressing said sequence, the expressed polypeptide 
consisting essentially of one or more non-catalytical domains. 

2. The host according to claim 1, wherein the DNA sequence is 
of another origin than Bacillus ssp. 

10 

3. The host according to claim 1 or 2 which is capable of ex- 
pressing the cellulose binding domain as a single polypeptide 
domain. 

15 4. The host according to any of the claims 1-3, wherein the 

cellulose binding domain has a molecular weight in the range of 
from 4 kD to 35 XD. 

5. The host according to claim 4, wherein the cellulose binding 
20 domain has a molecular weight not higher than 30 kD, preferably 

not higher than 28 kD, more preferably not higher than 25 kD. 

6. The host according to any of the claims 1-5, wherein the 
vector comprises a DNA sequence encoding for a single cellulose 

25 binding domain. 

7. The host according to any of the claims 1-5, wherein the 
vector comprises a DNA sequence encoding for a dimeric or a 
trimeric cellulose binding domain. 

30 

8. The host according to any of the claims 1--5, wherein the 
vector comprises a DNA sequence encoding for a cellulose bind- 
ing domain which is linked to at least one other non- 
catalytically active domain. 

35 

9. The host according to any of the claims 1-8, wherein the 
cellulose binding domain is obtainable from a microorganism or 
from a plant, preferably a bacterium or a fungus. 
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10. The hos-t according "to claim 9, wherein the bacterium is se- 
lected from the group consisting of the genera Butyrivibrio^ 
CBllulomonas , Clostridium, Microbisporst, Micromonospora, Pseu- 

5 domonas, streptomyces, Thermomonospora, Bacillus, Caldocellum, 
Erwxnia, Myxococcus, Cellvibrio, ThBrmoanaerobacterium, and 
Therjnotoga . 

11. The host according to claim 9, wherein the fungus is se- 
10 lected from the group consisting of the genera Agaricus, Die- 

tyostelium, Fusarium, Humicola, Neocallimastix, Neurospora, 
Limulus, Penicillium, PhanerochaBte, and Trichoderma. 

12. The Bacillus host according to any of the claims 1-11 which 
15 is neutralophilic, alkalophilic, mesophilic or thermophilic. 

13. The Bacillus host according to claim 12 which is selected 
from the group consisting of the species Bacillus subtilis, Ba-- 
cillus licheniformis. Bacillus megatBrium, Bacillus stBarothBr^ 

20 mophilos, and Bacillus amyloliquefaciBiis. 

14. The host according to any of the claims 1-13, wherein the 
vector is integrated into the genome of the untransformed host. 

25 15. The host according to any of the claims 1-14, wherein the 
vector is present as an expression plasmid. 

16. The host according to any of the claims 1-15, wherein the 
vector has been amplified on the genome or the expression plas- 

30 mid is a multicopy plasmid. 

17. A Bacillus expression vector which carries an inserted DNA 
sequence encoding for a cellulose binding domain. 

35 18. The vector according to claim 17 in which the expression 
cassette comprises regulatory regions from a Bacillus sp. 

19. The vector according to claim 18, wherein the Bacillus sp. 
regulatory regions are endogeneous to the host. 
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20. A method for producing a cellulose binding domain polypep- 
tide in a Bacillus host cell, the method comprising the steps 
of: 

5 - growing under conditions to overproduce cellulose binding do- 
main in a nutrient mediiun Bacillus host cells which have been 
transformed with an expression cassette which includes, as op- 
erably joined components, 

a) a transcriptional and translational initiation regulatory 
10 region, 

b) a DNA sequence encoding the cellulose binding domain 
polypeptide, 

c) a transcriptional and translational termination regulatory 
region, wherein the regulatory regions are functional in the 

15 host, and 

d) a selection marker gene for selecting transformed host 
cells; and 

- recovering the cellulose binding domain polypeptide. 

20 21. The method according to claim 20 wherein the produced cel- 
lulose binding domain polypeptide has a molecular weight in the 
range of from 4 kD to 35 kD. 

22. A method for optimization of CBD expression in a Bacillus 
25 host, the method comprising the steps of: 

a. expression in the host of a CBD fused to a reporter mole- 
cule; 

b. monitoring the concentration of expressed CBD in the super- 
natant of the fermented host by measuring the intrinsic prop- 

30 erty or properties of the reporter molecule. 

23. The method according to claim 22, wherein the reporter 
molecule is a Green Fluorescent Protein, and the intrinsic 
property is fluorescence emission. 

35 

24. A polypeptide hybride consisting essentially of one or more 
cellulose binding domain (s) fused to a green flourescent pro- 
tein. 
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25. A method of producing the hybride according to claim 24, 
wherein the hybride is expressed in a Bacillus host, growing 
the transformed host under conditions whereby the transformed 
5 culture is substantially free of untransf ormed cells; incubat 
ing the transformed culture in a nutrient medium, whereby the 
hybride is overproduced; and recovering the hybride. 
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